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On  the  Second  Moment  of  the  Remainder  Term  Appearing 
in  the  Intermediate  Order  Statistic  Representation 


by 

Vernon  V/atts 

ABSTRACT 

Under  certain  conditions  a sample  intermediate  order  statistic  from  a sequence 
of  independent  and  identically  distributed  random  variables  has  an  almost  sure 
representation  involving  the  empirical  distribution  and  a remainder  term  of  small 
order.  In  this  paper  an  asymptotic  approximation  of  the  second  moment  of  the 
remainder  term  is  obtained.  It  is  assumed  that  the  marginal  distribution  function 
of  the  independent  and  identically  distributed  sequence  has  a finite  left  endpoint. 
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1.  Introduction.  Suppose  that  are  independent  and  identically 

distributed  random  variables  on  (J2,  F,  P)  with  marginal  distribution  function 

(d.f.)  F(x)  «=  P(Xj  s x).  Let  I’®  integers  such  that  1 s s n for  each 

n and  * but  k^/n  -*•  0 as  n and  denote  by  the  smallest  of 

fnl  ^ 

^1*  *'*’  ^n*  ^^k  ^^nil  called  a sequence  of  intermediate  order  statistics. 

n 

Define  the  left  endpoint  x^  of  F by  Xq  = infix:  F(x)  > 0},  which  we  assume 
to  be  finite.  Let  Fj^Cx,  w)  = n"^  I(X^  s x)  be  the  empirical  d.f.  of  the 
sample  X^,  ...,  X^.  Then  under  certain  conditions  on  F,  has  the  representation 


(1.1) 


((0) 

n 


V" 


- F (x', 
n^  n’ 

F ' (x;) 


(d) 


r;  («*)). 


-1  1/4  3/4 

where  x^  satisfies  F(x^)  = k^/n  and  where  R^(u)  = 0(n  k^  log  n)  as  n • 
with  probability  one.  (See  Watts  (1977).)  In  this  paper  we  develop  an  asymptotic 
approximation  to  the  second  moment  of  the  remainder  term  retaining  the  same 
conditions  used  to  establish  (1.1)  and  imposing  some  additional  restrictions. 

Our  procedure  follows  that  employed  by  Duttweiler  (1973)  to  approximate  the 
second  moment  of  the  remainder  term  appearing  in  the  Baliadur  (1966)  representation 
of  sample  X-quantiles. 

Since  the  expected  value  of  the  k^^  smallest  order  statistic  of  a sample 

of  n variables  uniformly  distributed  on  the  unit  interval  (0,  1)  is  k/(n  ♦ 1) 

rather  than  k/n,  it  turns  out  that  a substantial  simplification  in  the  procedure 

is  effected  by  dealing  with  a representation  for  which  is  slightly  different 

n 

from  (1.1).  We  consider  instead 


(1.2) 


- X ♦ n_JL. 


- X„ 

n 


F'(Xn) 


* R„  (<*>). 


2 


where  is  defined  by  methods  used  to  prove  (1.1) 

-1  1/4  3/4 

also  show  that  R^jCu)  = 0(n  log  n)  with  probability  one. 

2.  Derivation  of  the  approximation.  The  following  preliminary  result, 
which  we  state  as  a lemma,  provides  an  exact  expression  for  the  second  moment 
of  when  F is  the  uniform  (0,  1)  distribution.  The  details  of  proof  are 
given  by  Duttweiler  (1973) . 

Lemma.  Let  be  the  smallest  order  statistic  for  the  set  U, , . . . , U of 
k 1’  ’ n 

n independent  random  variables  uniformly  distributed  on  (0,  1).  Define 


(2.1)  - k/(n  + 1)  + (IJ^j  I(U.  i k/(n  ♦ 1))  - n(l  - k/(n  + l)))/n 

and  let  R - . Then 


E(R^) 


n(n  ♦ 1) 


'k/(n+l) 


(k,  n + 1 


't/ta.!)'"  * 1. " ‘ 1 - lo)  - 


where  I^(a>  b)  is  the  incomplete  beta  function,  defined  by 


l3j(a,  b) 


Now  consider  a general  d.f.  F for  which  Xq  > Suppose  that  F(Xg)  ■ 0 
and  in  an  interval  (x^,  Xq  ♦ €)  F is  twice  differentiable  with  F”  boiinded,  and 
such  that  lim  F'(x)  exists  and  is  positive.  These  assumptions  insure  that  x 

X+Xq 

is  uniquely  determined  for  large  n.  If  k^^/log  n « as  n then  (1.2)  holds 
with  the  indicated  order  of  R^.  For  the  following  result  we  make  the  additional 
restriction  that  the  {X^}  have  a finite  second  moment,  that  is,  that  /x  dF(x)  < •», 
as  well  as  a slightly  stronger  requirement  on 
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Theorem.  Suppose  the  {X^}  have  marginal  d.f.  F satisfying  the  above  conditions 
0 

and  that  k /n  ■*■  ’»  for  some  9 > 0.  Then  for  R defined  by  (1.2), 
n n 

E(R^)  (nF' as  n -*■  ®. 


Proof.  Let  U, , . . . , U be  independent  uniform  fO.  I")  variables  with  k 
in  n 

r(n) 


th 


smallest  order  statistic  U, 


Define  the  quantile  function  Q(*)  by  Q(u) 


sup{x:  F(x)  s u}.  Then  each  Q(U^)  has  d.f.  F,  and 


R ^ QCU^"^)  - X - 
n k n 

n 


- i;.i  uQ(t't) . x„) 

nF'tx  ) 
n 


where  indicates  having  identical  distributions. 

We  may  suppose  that  F'(x)  is  positive  in  the  interval  A » (x^,  Xq  ♦ 6).  Let 

B » {u:  u ■ F(x),  x e A}.  Then  Q restricted  to  B is  the  inverse  of  F restricted 

to  A.  Let  p^  » k /(n  + 1).  There  is  an  integer  N.  such  that  p e B if  n i N, , 
nn  *^1  “^n  1 

and  we  have  that 


(2-2)  QCPj^)  = x^, 

(2.3)  ^Pn^  exists  and  equals  (F'(x^))'^, 
and 

(2.4)  0(u)  s x^  if  and  only  if  u s p^. 

Also,  Q"  exists  and  is  bounded  in  B,  and  a straightforward  derivation  shows  that 

/J  Q^(u)  du  » ExJ  < -. 

For  n 2 1 and  u e [0,  1]  define 

H„(u)  - Q(u)  - Q(p^)  - Q'(p„)(u  - p^). 
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Then  by  (2. 2) -(2. 4)  for  n ^ we  have  that 


' 1(P„)  • - p„)  • 


(2.5) 


n n n 

^ Cx\\ 

where  ■'  is  given  by  (2.1).  Now  by  Lemma  1, 
n 

2k 

ECuM  . OW,'  . n . 1 - E^,  - . 1,  „ . 1 . E„))  . OCk„/n’) 

Applying  the  relation 


I,(a,  b)  - I,(a  . 1.  b)  . - X)' 


for  positive  integers  a and  b (see  Abramowitz  and  Stegun  (1964,  Equation 
26.5.16))  and  Stirling's  formula 

n!  - e'"  n"*^/^(2iT)^/2(l  +0(n’b), 

we  obtain 

(2.6)  - uf"^)2  . (2/n)^^^(k^^^W)(l  ♦ o(l)). 

n n 

Therefore  to  complete  the  proof  of  the  theorem  it  is  sufficient  from  (2.5)  and 

(2.6)  and  by  the  Schwarz  inequality  to  show  that 

(2.7)  E(Kj(uj^"h)  - o(kyW). 

n 

Choose  o,  0 < o < 1/8,  and  let  e ■ k°/n^^^  and  I ■ (maxCO,  p -e},  p ♦€). 

n n n "^n  n n n 

r'e  nay  assume  that  I c B.  Denoting  the  probability  density  of  by  g , we 

n k n 


have 
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E = ( / * / ) H;J(u)  g^Cu)  du. 

n UE I ui I 

n ^ n 


Let  H 


= sup  h (u)  and  g „ 

n.nax  % ' n'-  ' ^n,max  i',  ‘'n 

* ueI  ’ uil 

n * n 


sup  g^  (u) . Then 
n 


E K^Cuf"^)  s + g /i  H^(u)  du. 

n^  k n,max  ®n,max  ^0  n'- 


2 2 2 2 

Also,  it  follows  from  the  inequality  (a+b+c)/3  sa  ♦b  ♦c  that 


(l/3)/J  H^(u)  du  s E ♦ Q^(p^)  ♦ (Q’(P„))‘ 


Since  E Xj  < «,  by  (2.2)  and  (2.3),  and  since  F'(x^)  tends  to  a non-zero  limit, 
there  is  a constant  < <»  such  that  for  n 2 Nj, 

/J  h2(u)  du  s Cj. 

Hence  for  n 2 N^, 

(2.8)  E H^(ui"^)  s ♦ C,  g 

n'  n,max  1 *'n,max 

n ' 

Then  letting  C2  ■ su^  (|Q’'(u)|}  < «,  we  have  by  Taylor's  expansion  that 
|H^(u)|  s ^^(u  - p^)  /2  for  u e I^,  and  therefore 

(2-9)  Hn  m»v  ^ E^/d 

H f 1D21X  2 H 


if  n 2 Mj. 


Next,  we  observe  that  the  density 


k -1 


n-k 


• <k  - l>?(n  - t )i  ""  »-")  0<U<1, 


has  modem  - (k  - 
n n 


l)/(n  - 1)  and  decreases  monotonically  on  both  sides.  Since 
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l"*n  ■ Pnl  “ “n  ^ n i ^ N^.  Let  tc  ^ 1.  'Ve  have 

E^k  • Pn^  ^ L - Pn^ 


^ ®n'P,.  * ‘ «■■(./"'  - (m„  - P„)^''*'), 


and  since  ne^^  -►  <»  whereas  !"„-?„  = 0(n”^)»  there  exists  C^  < «,  depending 
K,  such  that 

' 'n’  ^ S - Pn'^" 

n 

for  n ^ Nj  s N2.  In  a similar  manner  there  exists  < “ such  that 
i’n'Pn  - 'n)  ^ ^4  ' P-)^" 


ECU-"  - P„r 


for  n i Nj.  Then  letting  = max{C2,  C^}  gives 

*n,»ax  = S 

n 

Since  (see  Blom  (1958,  p.  42))  there  is  a constant  < <»  independent  of  n,  k^, 
and  K such  that 

_ \2ic  _ -K 


it  follows  that 


(2.10) 


E(U^  - p^)  ^ Cg  n , 


a see  n”*^  -“(^if*!) 
®n,max  ^6  ^5  " ^n 


for  n i Nj.  Then  (2.8),  (2.9),  and  (2.10)  lead  to 

E ^ cl  e^/4  4 e e e n”' 

nk^  Zn  165  n 
n 

fX 

for  n 2 N . Finally  we  may  suppose  that  k 2 n for  some  6 > 0.  so  that 
^ n 

^-(2ic+l)  ^ ^l/2-ea(2K+l) 

n 

and  by  choosing  k sufficiently  large  we  obtain  (2.7).  □ 
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